Search results for "part of speech"
showing 10 items of 17 documents
De Praepositione: The Emerging of Donatus’s Thought on Syntax?
2020
The paper aims at investigating the syntax treatment in late Antiquity through the analysis of the description of prepositions within Donatus’s Artes. As far as an organic and dedicated description of syntax is concerned, the Roman tradition of grammatical studies in late Antiquity shows an overall gap. However, reflections on syntax emerge from the parts of speech descriptions made by grammarians. The ultimate purpose of this paper is to understand if, and to what extent, traces of emerging thought on syntax can be found in Donatus’s description of prepositions. These are regarded as a syntactic object of study by the modern linguistic theory. To that end, the paper focuses on the textual …
Word classes and the scope of lexical flexibility in Tongan
2017
Abstract Tongan is an Oceanic language belonging to the Polynesian subgroup. Based on previous work (Churchward 1953, Tchekhoff 1981, Broschart 1997), Tongan has been classified as a 'flexible' language by various typological approaches on word classes (Hengeveld 1992, Rijkhoff 1998, Croft 2001). This means that lexical items are per se not categorised in terms of major word classes, but they can function as noun, verb, adjective and manner adverb without morphosyntactic derivation. However, not all lexemes are entirely flexible occurring within all these constructions. So the crucial issue of how flexible Tongan really is remains. This question will be addressed by a survey based on a comb…
Beyond decomposition: Processing zero-derivations in English visual word recognition
2019
Four experiments investigate the effects of covert morphological complexity during visual word recognition. Zero-derivations occur in English in which a change of word class occurs without any change in surface form (e.g., a boat-to boat; to soak-a soak). Boat is object-derived and is a basic noun (N), whereas soak is action-derived and is a basic verb (V). As the suffix {-ing} is only attached to verbs, deriving boating from its base, requires two steps, boat(N) > boat(V) > boating(V), while soaking can be derived in one step from soak(V). Experiments 1 to 3 used masked priming at different prime durations to test matched sets of one- and two-step verbs for morphological (soaking-SOA…
A Metric for Automatic Word categorization
2008
This paper presents a metric to be used by the working prototype WIH (Web Intelligent Handler). This metric (referred here as po) is designed to reflect main topic words and discriminate certain text profiles through word weightings. The actual version is designed only for Spanish web texts. Statistical analyses show that it is possible to differentiate text profiles upon po behavior. A poll is presented also, showing that it is a good main words discriminator. This paper is posted here as a new algorithm useful for Spanish text processing.
Integrating Computational Linguistic Analysis of Multilingual Learning Data and Educational Measurement Approaches to Explore Learning in Higher Educ…
2017
This chapter develops a computational linguistic model for analyzing and comparing multilingual data as well as its application to a large body of standardized assessment data from higher education. The approach employs both an automatic and a manual annotation of the data on several linguistic layers (including parts of speech, text structure and content). Quantitative features of the textual data are explored that are related to both the students’ (domain-specific knowledge) test results and their level of academic experience. The respective analysis involves statistics of distance correlation, text categorization with respect to text types (questions and response options) as well as lang…
Understanding Romanian Texts by Using Gamification Methods
2019
Abstract In recent years, there has been increasing interest in the field of natural language processing. Determining which syntactic function is right for a specific word is an important task in this field, being useful for a variety of applications like understanding texts, automatic translation and question-answering applications and even in e-learning systems. In the Romanian language, this is an even harder task because of the complexity of the grammar. The present paper falls within the field of “Natural Language Processing”, but it also blends with other concepts such as “Gamification”, “Social Choice Theory” and “Wisdom of the Crowd”. There are two main purposes for developing the a…
Part-of-speech labeling for Reuters database
2015
Even if the Vector Space Model used for document representation in information retrieval systems integrates a small quantity of knowledge it continues to be used due to its computational cost, speed execution and simplicity. We try to improve this document representation by adding some syntactic information such as the parts of speech. In this paper, we have evaluated three different tagging algorithms in order to select the most suitable tagger for using it to tag the Reuters dataset. In this work, we have evaluated the taggers using only five different parts of speech: noun, verb, adverb, adjective and others. We considered these particular tags being the most representative for describin…
Precategoriality and syntax-based parts of speech
2008
Late Archaic Chinese is a precategorial language, i.e., a language whose lexical items are not preclassified in the lexicon for the syntactic functions of N and V. This will be shown on the basis of structural-conceptual criteria as those developed by Croft (2000) and Sasse (1993b) as well as on the basis of methodological criteria as those suggested by Evans & Osada (2005). As is claimed in Construction Grammar (Goldberg 1995, 2005), the meaning of lexical items is derived by integrating their own lexical meaning with the meaning contributed by the construction. The construction analysed in this paper is the argument structure construction. Linking between lexicon and syntax is subject…
La linguistique des grammaires françaises publiées en Espagne dans la première moitié du XIXe siècle
2005
RésuméDans cet article, nous examinons un corpus de 13 grammaires pour l’enseignement du français aux Espagnols, éditées dans la première moitié du XIXe siècle. Nous prenons en compte, dans une analyse de type transversal, (1) les sources citées par les auteurs ; (2) la nature de la définition de la grammaire et le nombre des parties du discours ; (3) la définition du nom (avec la présence ou non du schéma canonique de la déclinaison ou des classes spécifiques de cet élément) ; (4) la définition du verbe avec la présence ou non des catégories canoniques) ; et (5) la syntaxe. Notre objectif est de déterminer la linguistique explicite et implicite de ces grammaires scolaires, à une époque où …
From Adjectives to Quantifiers. Evidence from Old and Modern Catalan
2011
La història dels quantificadors indefinits de les llengües romàniques és la història del desenvolupament de nous trets distribucionals per un grup escollit d’adjectius llatins (Company 1991, 1997; Batllori 1998). Aquesta distribució serà responsable de la constitució posterior de la nova subclasse de determinants que anomenem quantificadors. Com s’explica a Zamparelli (2000), l’augment de la complexitat estructural del marge esquerre del SDet suposa l’especialització de paraules que pertanyien a antigues categories per aquestes noves posicions. És així com la posició de quantificador genera la categoria quantificador. Estem devant un procés de llarga durada amb conseqüències que encara són …